Scene change detection by audio and video clues
نویسندگان
چکیده
Automatic video scene change detection is a challenging task. Using audio or visual information alone often cannot provide a satisfactory solution. However, how to combine audio and visual information efficiently still remains a difficult issue since there are various cases in their relationship due to the versatility of videos. In this paper, we present an effective scene change detection method that adopts the joint evaluation of the audio and visual features. First, video information is used to find the shot boundaries. Second, the audio features for each video shot can be extracted. Lastly, an audio-video combination schema is proposed to detect the video scene boundaries.
منابع مشابه
Compressed Domain Scene Change Detection Based on Transform Units Distribution in High Efficiency Video Coding Standard
Scene change detection plays an important role in a number of video applications, including video indexing, searching, browsing, semantic features extraction, and, in general, pre-processing and post-processing operations. Several scene change detection methods have been proposed in different coding standards. Most of them use fixed thresholds for the similarity metrics to determine if there wa...
متن کاملIEEE Transactions on Multimedia EDICS: 4-SEGM Enhanced Eigen-audioframes for Audio-visual Scene Change Detection
In this paper, a novel audio-visual scene change detection algorithm is presented and evaluated experimentally. An enhanced set of eigen-audioframes is created that is related to an audio signal subspace, where audio background changes are easily discovered. An analysis is presented that justifies why this subspace favors scene change detection. Additionally, a novel process is developed in ord...
متن کاملVodcast: A Breakthrough in Developing Incidental Vocabulary Learning
Incidental vocabulary learning is often seen as superior to direct instruction on many occasions. Meanwhile, upon the emergence of the World Wide Web, second language (SL) learners have been introduced to 'podcasts' (recorded audio and video online broadcasts) which could be authentic sources of vocabulary learning. The relatively recent phenomenon of video podcast (vodcast) might be considered...
متن کاملTraffic Scene Analysis using Hierarchical Sparse Topical Coding
Analyzing motion patterns in traffic videos can be exploited directly to generate high-level descriptions of the video contents. Such descriptions may further be employed in different traffic applications such as traffic phase detection and abnormal event detection. One of the most recent and successful unsupervised methods for complex traffic scene analysis is based on topic models. In this pa...
متن کاملAudio scene analysis and scene change detection in the MPEG compressed domain
The use of audio to retrieve and index the associated video is a relatively new approach. In this paper the focus is on MPEG video. For indexing and retrieval one needs to segment the audio stream associated with the video in terms of gender, speech, music and the speaker. This is called “Audio scene” analysis. The paper discusses techniques for such analysis in the MPEG audio compressed domain.
متن کامل